Skip to main content

All Questions

Tagged with
0votes
0answers
19views

CLIPSeg: no change in performance metrics with a better convolutional decoder

I am training CLIPSeg on the Oxford IIIT pet dataset for semantic segmentation (3 classes: background, cat, dog). In short, what I do is I stick a decoder on the CLIP encoder. The encoder outputs: ...
Stan's user avatar
0votes
0answers
31views

Why is training for the game 2048 not working well for me?

So i started training neural networks for the game 2048. Here is my code for using DQN algorithms to train. The gymnasium env code should be OK, so im not putting it here, but if you need, here is the ...
SuXY's user avatar
0votes
0answers
60views

How to solve the issue with getting free ports in Pytorch DDP?

I am facing issues with getting a free port in the DDP setup block of PyTorch for parallelizing my deep learning training job across multiple GPUs on a Linux HPC cluster. I am trying to submit a deep ...
Shataneek Banerjee's user avatar
0votes
0answers
68views

Convert sciBERT to GGUF

I want to use the SciBERT weights in Ollama. Ollama accepts GGUF format, whereas SciBERT is in another format. I downloaded SciBERT from this huggingface link. I tried to convert it with llama.cpp ...
Igor Popov's user avatar
2votes
0answers
109views

Solving the 2D Helmholtz Equation over a sphere with a PINN

Background: I am working on implementing solutions to various partial differential equations through Physics-informed neural networks (PINNs). I have started working on the Helmholtz equation and was ...
Axion004's user avatar
2votes
1answer
124views

Custom Loss Function Traps Network in Local Optima

I am working with a feedforward neural network to fit the following simple function: N(1) = -1 N(2) = -1 N(3) = 1 N(4) = -1 But I don't want to use the Mean-...
Andrew Baker's user avatar
0votes
0answers
38views

PPO with multiple actions per action vector

I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...
Tofara Moyo's user avatar
0votes
0answers
59views

PyTorch and Opacus for Differential Privacy

When testing an example code from the TensorFlow website using Jupyter Notebook, which is available at the following link: [LINK_1], I encountered an error. You can find my question about that error ...
Questioner's user avatar
0votes
0answers
22views

Spikes in Loss During Training both train/val datasets with LSTM

I'm seeing good results I think, but I want to understand why these spikes in loss are occuring. As you can see, it would appear that my training is working as it should, but every 200 or so epochs ...
Romuloux's user avatar
0votes
0answers
61views

SSIM in validation higher then SSIM in training for image denoising

I'm working to denoise microscopy images using a 2D U-Net. I'm training my network on images taken at different z-levels, and these images have ground truth, which is the mean of the images in z. ...
Fab G's user avatar
0votes
1answer
148views

How to Create a 1D Embedding from Tensors of Varying Sizes?

I am a newbie in AI and playing with some computer vision algorithms. I have three tensors with different sizes. Noise augmentation levels tensor with size (N, C, H, W), diffusion timestep tensor of ...
Pooya Kamranjam's user avatar
1vote
2answers
106views

How to Represent Boardless Board Game as Input to RL Model?

I am currently doing my thesis project by creating an Imitation Learning (IL) agent that learns to play the board game Hive, which lacks a traditional 2D board. Pieces are placed relative to one ...
Johnny McKenzie's user avatar
2votes
1answer
1kviews

How do I start implementing a deep learning paper from scratch?

I am fairly new to deep learning. I want to implement a deep learning paper from scratch with proper data preprocessing, model, losses etc., using an object-oriented approach in python. I want to do ...
programmer_04_03's user avatar
1vote
0answers
482views

Where can I get Imagenet test dataset labels for evaluation

I have the imagenet train, validation and test set. I have been able to assign each image in the validation set into its respective class folders with the help of some online resources. However, for ...
Craving_gold's user avatar
0votes
1answer
164views

How to improve the performance when no shuffling of dataloader is needed?

I'm currently doing some researches on video recognition. What I'm trying to do is like this paper. The idea is that: for processing a specific input video clip (shape: [T, C, H, W]), it needs ...
Henry's user avatar

153050per page
close